Dynamic Estimation in Computational Vision Toshio
نویسنده
چکیده
Spatial coherence constraints are commonly used to regularize the problems of reconstructing dense visual fields like depth, shape, and motion. Recent developments in theory and practice show that the local nature of spatial coherence constraints allows us to solve single-frame reconstruction problems efficiently with, for example, multiresolution approaches. While it is reasonable to impose temporal as well as spatial coherence on the unknown for a more robust estimation through data fusion over both space and time, such temporal, multi-frame extensions of the problems have not been as widely considered, perhaps due to the different and severe computational demands imposed by the sequential arrival of the image data. We present here an efficient filtering algorithm for sequential estimation of dense visual fields, using stochastic descriptor dynamic system models to capture temporal smoothness and dynamics of the fields. Theoretically, standard Kalman filtering techniques (generalized for stochastic descriptor systems) are applicable to solving temporally-extended visual field reconstruction problems, but their implementation is practically impossible because of the high dimensionality and because the time-varying nature of such problems requires on-line propagation of large covariance matrices. By exploiting the inherent local spatial structure of the reconstruction problem, however, we have developed filtering techniques that effectively approximate the information form of the Kalman filter. This is achieved by replacing covariance propagation steps with what are essentially low-order spatial model identification steps, in which spatial models with a strictly local support are constructed based on covariance information. In effect, we are decomposing the multi-frame problem into a series of Bayesian single-frame problems, in which the spatial prior model used reflects knowledge from the previous image frames. The resulting filtering algorithm has memory and computational requirements of O(N) each for a frame of data, where N is the number of pixels in a frame, and, additionally, the filter is implementable in parallel. As low-level visual field reconstruction is often considered to be a front-end in a hierachical visual processing system and thus might be VLSI-implemented, we have also designed a square root version of the information Kalman filter as an alternative algorithm with a reduced numerical dynamic range. The square root information 3 filter features an efficient, iterative computational structure and is parallelizable as well. Experiments have shown several beneficial effects of our multi-frame formulation applied to the sequential estimation of optical flow. For example, temporal assimilation of the data makes the reconstruction more robust to noise. Also, there are cases where the classic "aperture problem" of motion vision cannot be resolved satisfactorily by spatial regularization alone but is dramatically resolved by the additional temporal coherence constraint. Thesis Supervisor: Alan S. Willsky Title: Professor, Electrical Engineering
منابع مشابه
Numerical Simulation of Laminar Convective Heat Transfer and Pressure Drop of Water Based-Al2O3 Nanofluid as A Non Newtonian Fluid by Computational Fluid Dynamic (CFD)
The convective heat transfer and pressure drop of water based Al2O3 nanofluid in a horizontal tube subject to constant wall temperature condition is investigated by computational fluid dynamic (CFD) method. The Al2O3 nanofluid at five volume concentration of 0.1, 0.5, 1.0, 1.5 and 2 % are applied as a non Newtonian power law and Newtonian fluid with experimentally measured properties of density...
متن کاملRobust H_∞ Controller design based on Generalized Dynamic Observer for Uncertain Singular system with Disturbance
This paper presents a robust ∞_H controller design, based on a generalized dynamic observer for uncertain singular systems in the presence of disturbance. The controller guarantees that the closed loop system be admissible. The main advantage of this method is that the uncertainty can be found in the system, the input and the output matrices. Also the generalized dynamic observer is used to est...
متن کاملAn Efficient Matching Algorithm for Segment-Based Stereo Vision Using Dynamic Programming Technique
An efficient matching method for segment-based stereo vision is proposed. A potential matching graph which describes the connectivity between candidate matching pairs of segments is built. Establishing correspondence is then reduced to a problem of searching for the optimal path that maximizes a similarity measure. The optimal path is found efficiently without the adverse effects of combinatori...
متن کاملInverse Problems in Imaging Systems and the General Bayesian Inversion Frawework
In this paper, first a great number of inverse problems which arise in instrumentation, in computer imaging systems and in computer vision are presented. Then a common general forward modeling for them is given and the corresponding inversion problem is presented. Then, after showing the inadequacy of the classical analytical and least square methods for these ill posed inverse problems, a Baye...
متن کاملComputation Optical Flow Using Pipeline Architecture
Accurate estimation of motion from time-varying imagery has been a popular problem in vision studies, This information can be used in segmentation, 3D motion and shape recovery, target tracking, and other problems in scene analysis and interpretation. We have presented a dynamic image model for estimating image motion from image sequences, and have shown how the solution can be obtained from a ...
متن کامل